Short Text Classification Using Contextual Analysis

نویسندگان

چکیده

Micro blogging tools provide a real time service for the public to express opinions, broadcast news and information offer an opportunity comment respond such output. Word usage in social media is continually evolving. bloggers may use different sets of words describe specific event they new (i.e. neither exist training dataset nor informal or formal dictionaries) contexts. Dynamically capturing their potential meaning from context can help reflect relationship media, which then be useful solving various problems, like classification task. Different approaches have been proposed this regard, one them Contextual Analysis. This paper focuses on examining approach grouping short texts (tweets) talking about same into category. A transparent method text multi-class categorization presented. It uses Analysis capture most important detect similar In order test efficacy these areas, study evaluates performance other well known methods, as Naïve Bayes, Support Vector Machines, K-Nearest Neighbors Convolutional Neural Networks. On average, experiments’ results show that effectively categorize tweets groups, with high f1-measure score f1>97.09% f1>95.27%, imbalanced classes number experiments, respectively. However, baseline negatively influenced by dataset. The Networks produces best among algorithms f1>97.74% all 1.73% 2.72% higher than lowest Naive Bayes Neighbors, respectively, but does not meet requirements transparency results.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Uyghur Short Text Classification Using Morphological Information

In this paper, we propose a novel method for improving the classification performance of short text strings using conditional random fields (CRFs) that combine morphological information. Experimental results on three datasets (Uyghur, Chinese, and English) demonstrate that our method can yield higher classification accuracy than Support Vector Machine (SVM) classifier and Maximum Entropy Model ...

متن کامل

Improving Text Classification by Using Conceptual and Contextual Features

The exponential growth of text available on the Internet has created a critical need for accurate, fast, and generalpurpose text classification algorithms. This paper examines the improvement of broadly based text classification by using features that are easily extracted from training documents. These features represent both the conceptual and contextual properties of a target class, and inclu...

متن کامل

Contextual feature selection for text classification

We present a simple approach for the classification of ‘‘noisy’’ documents using bigrams and named entities. The approach combines conventional feature selection with a contextual approach to filter out passages around selected features. Originally designed for call for tender documents, the method can be useful for other web collections that also contain non-topical contents. Experiments are c...

متن کامل

Contextual Rules for Text Analysis

In this paper we describe a rule-based formalism for the analysis and labelling of texts segments. The rules are contextual rewriting rules with a restricted form of negation. They allow to underspecify text segments not considered relevant to a given task and to base decisions upon context. A parser for these rules is presented and consistence and completeness issues are discussed. Some result...

متن کامل

Topic Modeling and Classification of Cyberspace Papers Using Text Mining

The global cyberspace networks provide individuals with platforms to can interact, exchange ideas, share information, provide social support, conduct business, create artistic media, play games, engage in political discussions, and many more. The term cyberspace has become a conventional means to describe anything associated with the Internet and the diverse Internet culture. In fact, cyberspac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2021

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2021.3125768